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Abstract A novel coronavirus strain was isolated from laryngotracheal swab of wild partridge and designated as partridg¢é] GI] 
S14]2003] S140]. Its whole genomic sequence was obtained] GenBank Accession number[] AY646283[] through RT-PCR amplification[] 
cloning{] nucleotide sequencing[] and analysis by the DNASTAR program. To investigate the origin of the virusl] we further analyzed the 
nucleotide sequences of the main structural proteins[] and compared those with other available virus isolates. Our results showed that the 
highest nucleotide homologies between the S1 gene of S14 strain and those of nephrogenic-type strains JX1-99 and TJ2-96 were 94.6% 
and 93.4%[] respectively. In addition[] a relatively high genetic identity[] 85% and 84.3%{] respectivelyL] was detected between S1 gene 
of S14 and those of strains QXIBV and LX4. The results suggested that the S14 strain may be originated from or related to nephrogenic- 
type and proventriculus-type infectious bronchitis viru¢{] IBVL]. The highest nucleotide homology between the S2 gene of S14 strain and 
those of QXIBV and LX4 was 85% and 84.3%[] respectively and all of them belonged to group II coronaviruses. The highest nucleotide 
homology between the M gene of S14 strain and those of strains SAIB20 and GD6-98 was 90.6% and 90.2 %{[] respectively by which S14 
belonged to group III. Although they displayed high level of genetic identity in S1 and S2 gene[] there was lower homology of M coding se- 
quences between S14 and BJ[] and between S14 and QXIBV strains. Phylogenetic analysis of N gene indicated that group I strains might 
evolve from RNA recombination between strain H52 and Gray] while group II strains from strain H120 and D1466. S14 strain had the 
highest N gene homology with strain QXIBV which was 95.7%[] thus classified as a group II] member. Strains SAIB20 and GD6-98 
which were closely related to the M gene of S14 strain belonged to group I and IV[] respectively. A possible role of partridge S14 strain 
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may play in the process of coronavirus evolution is discussed. 


Coronavirus exists widely in various kinds of do- 
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zmestic and wild animals. They often result in respira- 
Btory or intestinal illness including vomit[] diarrhea and 
‘Zeven death of the infected animal! 4. In this study[] 
one wildbird coronavirus was isolated and character- 
ized from a total of 1510 laryngotracheal swabs and 
883 tissue samples[] which were collected from wild 
poultry distributed in various areas of Guangdong 
SProvince[] China. This strain was closely related to a- 
Avian infectious bronchitis virus but distantly to SARS- 
CoV by phylogenetic comparison with viruses using 
genomic sequence data available in GenBank™. 


1 Materials and methods 
1.1 Materials 


Viruses|] Partridge coronavirus[] designated as 
S14{] was isolated and identified by the Key Laborato- 
ry of Poultry Feeding and Diseases Controll] Ministry 
of Agriculture[] China. 


Keywords[]__ wild bird[] coronavirus[] structural protein genes[] virus evolution. 


Reference strainsL] Respiratory-type strains in- 
cluded M41 [J] Sl M21883{] S2[] X04722[] MU 
AF286184L] NL] M28566L[] Ark99[] SL] L10384UNU 
M85244L]] SD1-97]] AY325732 [J] DEO72 0 MU 
AF202999[ INL] AF203001L1] D1464] SE] M219710M{] 
AF203004L] NU] AF203006 [1] HaNi1-95 [J SIU 
AY251817]] MU AY325734 [L HaN2-95 

LW) AY325728 []] GxX1-98 [|] SIU) AY319302[] MU 
AF325727{1] NI} AY278110 LT] GxX2-98 [] SIU 
AY251816]] MU) AY325733[] NE] AY251816[]] H120 

L) SO) M21970]] MU] AY028295[] NU] AY028296[]] H52 

L} SL) AF352315—{] ML] AF286185[]. CU-T20 SJ 
U04739[] MUL] U46035L] NE] U04803[] was a respirato- 
ry mutant type[] nephrogenic-type strains included 
JX1-99]] AF210735L[] SAIBwjL] AF397528[]] Holte 

L] SL] L18988[] AF334685L]] Gray] SIL] L14069]] 820] 
AF394180 [1] KB8523 [] M21515({7] X J SIL 
AF427819[] NEU] AY04331500BJU SL AY3196551[]. 
Proventriculus-type strains included Ql 


[] AF286302[0 J20 AF352312(1] T30 AF227438[1] 
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QXIBV |] AF193423[[] LX4]} SJ] AY185197]] NU 
AY338732L1] ZJ971L] SL] AF352311L] NU] AF352308[]. 


Specific-pathogen-free [] SPF [ 
chicken eggs[] Nine-day-old SPF embryonated chicken 
eggs were kindly provided by Experimental Station of 
Veterinary Medicinel] South China Agricultural Uni- 
versity. 


embryonated 


Total RNA Extraction Kit III] was purchased 

from Sino-American Biotechnology Co. AMV XL[] 

DNasin{] 3°-Full RACE Core Set[] 5°-Full RACE Core 

=Sei[] pl NL6L] and Ex Tag enzyme were provided by 
STaKaRa Biotechnology] Dalian[] Co. L] Ltd. 


3 
El .2. Methods 


oI std and RNA 
extraction S14 strain virus was inoculated to 10- 
“day-old SPF embryonated chicken eggs[] followed by 
Sincubation at 37°C for 48 hours. Chicken embryos 


oe 


that died in 24 hours post-inoculation were discarded. 


Virus propagation 


ara 


Allantoic fluids were collected asepticallyL] centrifuged 
at 4000 rpnt]min at 4°C[] and then virus genomic 
RNA was extracted from the supernatant according to 


the protocols of the Total RNA Extraction Kit III 


L] Sino-American Biotechology Ltd. [. 


1.2.2 RT-PCR amplification] cloning and sequenc- 
RT-PCR amplifica- 


tion and cloning of virus genome segments were car- 


ing of virus genome segments 


ried out according to the methods of Ren et alU ©, 
The extracted viral RNA was desolved in 11.5 pb 
DEPC-treated water and then reversely transcribed. 
The 3’- and 5°-Full RACE were performed according 
to the 


recommendations of the manufacturer 


L] TakaraL]. Positive clones were sent to Shanghai 


BioAsia Biological Technology Co.[] Ltd. for nu- 


cleotide sequencing. 


A total of 29 pairs of primer$] Table 1[] were de- 
signed for cloning and sequencing of genomic seg- 
ments and synthesized by Shanghai BioAsia Biological 


Technology CO. |] Ltd. 
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Table 1. Primer pairs designed 
Forward Reverse bp 
1 5’-CAA GCC TAA AAC AGG GAG TAT C-3’ 5’-GTT AAG TCA TTT CGC ATG CC -3’ 940 
2 5’-GGC ATG CGA AAT GAC TTA AC-3 5’-AGG TCG TCA CAC ATT TTC TC-3’ 941 
3 5’-GAG AAA ATG TGT GAC GAC CT -3’ 5 -CAG AAC TCT TTA AAC TCA TCC A-3° 1421 
4 5’-TGG ATG AGT TTA AAG AGT TCT G-3’ 5’-GAG CGG TAT TTA ACA CCA T-3’ 988 
2) 5’-ATG GTG TTA AAT ACC GCT C-3’ 5’-AGG TTT ATT AAG GTG TIC TG-3’ 800 
6 5’-GCA GAA CAC CTT AAT AAA CCT-3’ 5’-TTG CGC ATA ATA GAT GGC-3’ 677 
7 5’-GCC ATC TAT TAT GCG CAA -3’ 5’-TTT CTC TAT TAA ACC AAG TAG GA-3’ 1034 
8 5’-TCC TAC TTG GTT TAA TAG AGA AA-3’ 5’-ATT ACC AAC AAA CTC ATT GCC-3’ 962 
9 5’-GGC AAT GAG TTT GTT GGT AAT -3’ 5’-TCA GTT CCA GTG TGT AAT GCA -3’ 926 
10 5’-TGC ATT ACA CAC TGG AAC TGA-3’ 5’-TTC AAC ATT AAG CTT AGT CAA A-3’ 1150 
11. 5’-TTT GAC TAA gCT TAA TGT TGA A -3’ 5’-TCC AAA CAA CTG TTG AAT ATG -3’ 1730 
12 5’-CAT ATT CAA CAG TTIG TIT GGA-3’ 5’-GGT CTT TYT CCG TAG TAG GTA TT-3’ 1628 
13. 5’-AAT ACC TAC TAC GGA RAA AGA CC-3’ 5’-TAC TCT TGA TCC CAG TAC TTA AAG-3’ 741 
14 5’-CTT TAA GTA CTG GGA TCA AGA GTA-3’ 5’-GGR TAG CCW GCA CTC TTA TC-3’ 1324 
15 5’-GAT AAG AGT GCW GGC TAY CC-3’ 5’-ACT AGC ATT GTR TGT TGW GAA CA-3’ 453 
16 5’-TGT TCW CAA CAY ACA ATG CTA GT-3’ 5’-CCA AAA ACR GTA CCA TTA GAT AC-3’ 696 
17. 5-GTA TCT AAT GGT ACY GTT TIT GG-3’ 5’-TTT ATC TTA CCA TTA ATA AAK GAC-3’ 1298 
18 5’-GTC MTT TAT TAA TGG TAA GAT AAA-3’ 5’-GTT GTA GCT CTA KAA CCA CAA GA-3’ 481 
19 5-GTT CTT GIG GTT MTA GAG CTA CAA C-3’  5’-TTT GGC AGT GTG CGA ATR TT-3’ 1326 
20 5 -AAY ATT CGC ACA CTG CCA AA-3’ 5’-TGT CTT TTG TIC AGT TTT CAA-3’ 871 
21 5 -TTG AAA ACT GAA CAA AAG ACA-3’ 5’-CAT AAC TAA CAT AAG GGC AAT-3’ 1372 
22 5-CAG TIT GTA GTT TCT GGT GG-3’ 5’-TCA TTC CAC CAT TTW GAC AA-3’ 840 
23. 5'-TTG TCW AAA TGG TGG AAT GA-3’ 5’-ATT CCA ACC GTT CTT AGG-3’ 657 
24 5'-CCT AAG AAC GGT TGG AAT-3’ 5’-TAC TCT CTA CAC ACA CAC -3’ 632 
25. 5'-TGA CCA ARG CGG AAA TAA GA-3’ 5’-AAT GAA RTC CCA ACG GAA AT-3’ 1290 
26 5'-GAT GGT ATA GTG TGG GTT-3’ 5 -CIC:CI€ ATL CAT CIT GIE 3° 1644 
27 5'-GAT GAC AAG ATG AAT GAG GA-3’ 5’-AAG CTA CAT GCC TAT CTT CC-3’ 1010 
28 5 RACE1-F5’-CAT TTA GAA GAC ATC TTT GGT 5’ RACE1-R5’-TGT GAC GTA TAG AAA AAC 604 
GTC-3’ AAA GCG TCA C-3’ 
29 3’RACE-R5'-CTG ATC TAG AGG TAC CGG ATC 332 


3° RACE-F5’-GGA AGA TAG GCA TGT AGC TT - 
z 


C3' 
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Analysis of nucleotide sequence was carried out 
by the BLAST program and compared with published 
relevant sequences available in GenBank!™. A num- 
ber of representative viruses from different geographi- 


cal locations were selected and analyzed by DNAS- 
TAR software package. 


2 Results 
2.1 RT-PCR amplification of S14 genome 


S14 strain was used for RNA purification and 
RT-PCR amplification as described above. As shown 
tin Fig. IL] we successfully obtained the cDNA frag- 
Gments by genomic terminal RNA RACEJ[] lane 1] 
29] or routine RT-PCR. 


Frise. ob; 
Marker-DL2000] lane 19]5’ RACE[] lanes 2—28]] cloning segments 
using the primer pairs as shown in Table 1] lane 29[] 3’ RACE. 


RT-PCR amplification of S14 whole genomic RNA. Mi] 


y of California Santa Barbara] at 17:58 29 F 


B2. 2 Analysis of nucleotide sequences of SI protein 
oO . . 

genes and deduced amino acid sequence 

= 


S1 hypervariable regions of the two strain¢] M41 
sand S14] were obtained by sequencing] which includ- 
Sed the nucleotides from start coden ATG to the cleav- 
Lage site of S precursor protein. Sl gene of S14 strain 
Zwas 1620 bp longL] encoding 540 amino acids and con- 
Ptaining 19 potential N-glycosylation sites. The G+C 

percentage was 35.49%. Hydrophilic analysis of de- 
duced amino acid sequences of S1 protein showed that 
the first 20 N-terminal amino acid residues of S1 pro- 
tein of S14 virus formed a hydrophobic region] which 
was presumed as the signal sequence of S precursor 
protein. Amino acid sequences at the cleavage site of 
S precursor protein were RRFRR. 


Compared with reference strain M41[] there were 
several insertions and deletions in S14 strainl] a TCT 
insertion at position 67[] ATTA at 216L] ATAATCCA 
at 245[] A at 353[] G at 266[] A at 350[] AG at 360] 
TCT at 362] ATGG at 420 and CT at 429. On the 
other hand[] the deletions of ATTCATGGTGGT at 


position 221—233[] T at 236[] A at 262[] ATGGCC at 
415—421 were identified. There were similar muta- 
tions in LX4 and QXIBV strains[] inserting TC and 
AG at position 360[] respectively. There were also 
some nucleotide substitutions in S14 strain. Com- 
pared with position 25 of M41 strain[] the deduced 
amino acid sequence of S14 had an Asr{] NL insertion 


at this point and Gly] GL] + Set] SU] at position 119. 


Fig.2 shows four groups of infectious bronchitis 
viruseq] IBV[]. Group I could be divided into two 
subgroups[] in which Grayl] Vic SL] JMK strains be- 
long to subgroup A and others belong to subgroup B. 
Group II could also be divided into two subgroupsl] 
with S14{] JX1-99[] and TJ2-96 strains classified to 
subgroup C and other strains to subgroup D. Sub- 
group B strains had high genetic identity with strains 
H52[] H120 and M41. It was speculated that Group I 
viruses were mutated from vaccine virus. All Group 
II viruses were isolates from China including pub- 
lished 
strains. S14 strain was also found in this group. Gx2- 
98 isolate from China of Group III had close identity 
with strain Holte[] and it was deduced that it might 
be mutated from Holte. 


nephrogenic-type and __ proventriculus-type 


Group IV strains were 
proventriculus-type infectious bronchitis viruses iso- 
lated in Chinal] the S1 gene of which was distantly re- 
lated to those of other viruses. 


2.3 Analysis of nucleotide sequences of S2 gene and 
deduced amino acid sequence 


The coding region of S2 protein of the two 
straind] M41 and S14[] was obtained by nucleotide se- 
quencing analysisL] covering the nucleotides from the 
cleavage site of S precursor protein to the stop codon. 
S2 protein coding region of S14 strain was 1878 bp in 
lengthL] which encodes 625 amino acids. The percent- 
age of G+C was 35.52%. S14 strain had 13 N-gly- 
cosylation sites through analysis of glycosylation 
recognition signals. There was a hydrophobic region 
at position 550 to 590 in the C-terminall] which might 
be the connecting region of envelope and nucleocap- 
sid. Compared with the published sequence of M41 
strairL] GenBank accession number] M21883[]] there 
were many nucleotide mutations in S2 protein gene of 
S14 strain. 


Coronavirus strains of Group I were closely relat- 
ed to H52[] M41 and Holte. All Group II viruses[] in- 
cluding published nephrogenic-type and proventricu- 
lus-type strains[] were isolated from China. S14 strain 
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was also in this group. Taken together[] there was 
close genetic identity between Group I strains and 


Group II strains[] while Group III viruses were repre- 


23.6 
20 15 


ae 14 12 10 8 
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sentatives for American respiratory-type and vaccine 


straing] Fig. 3[]. 


HaN1-95* 

H120 

H52 

zj971* 

KB8523 

M41 Group I 
GX1-98* 
HaN2-95* 


BJ* Group I 


Fig. 2. Phylogenetic tree of Sl gene. * IBV isolates from China. 


BJ* 


D1466 
DE072 Group Mi 


4 2 0 


Fig. 3. Phylogenetic tree of S2 protein gene. * IBV isolates in China. 


2.4 Analysis of nucleotide sequences of M protein 
gene and deduced amino acid sequence 


Nucleotide sequences and deduced amino acid se- 
quence of M protein of these two strains were further 
analyzed. The open reading frame of the M protein of 
S14 strain was 678 bp in lengthL] encoding 225 amino 
acids. There were two N-glycosylation sites at the N- 
terminal of S14 strain. The first 11 amino acids of 


S14 strain were hydrophilic and were presumed as the 
outmembrane domain of M protein[] their downstream 
20 to 100 amino acids formed three hydrophobic re- 


gions and were considered as three transmembrane 
domains. 


In comparison with the published sequence of 
H52-GD|] GenBank accession number[] AY044184[[] 
there were many nucleotide substitutions but no in- 
sertion or deletion in M protein gene of S14 strain. 


Based on M gene dendrogram[] Group I could be 
further divided into subgroups A and B. While JL1- 
97[] GX1-98[] DEO72[] HaN2-95[] and HEN4-94 were 
closely related to H52[] M41 belonged to subgroup A. 
D1466|] H120]3] SAIBwjl] SD1-97[] SC2-93 and 
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Cal99[] which had the highest genetic dentity with 
H120[] belonged to subgroup B. S14 belonged to 


8.4 
8 6 4 


Fig. 4. 
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s2.5 Analysis of nucleotide sequences of N protein 
S 
Sgene and deduced amino acid sequence 


Nucleotide sequence and deduced amino acid se- 
“quence of N protein gene of two strains[]M14 and 


alifornia 


OS14U] were obtained by sequencing. The open reading 
cframe of N protein of S14 strain was 1230 bp in 
“AlengthL] encoding 409 amino acids. There was one N- 
glycosylation site at the N-terminal of S14 strain[] 
Swith another in the middle. N protein was hy- 
zdrophilicwhich may be beneficial for the interaction 


sof N protein and RNA. 


nloade 


In comparison with the published sequence of 
SM 10 GenBank accession number[] M28566[[] the N 
aan gene of S14 strain had nucleotide substitutions 
but no insertion or deletion. 


Based on the analysis of N protein genel] Group I 
could be further divided into subgroups A and BI] of 
which JL1-97[] SAIB14 and SAIBwj closely related to 
H52 belonged to subgroup A and SAIB20[] GX1-98[] 
GX2-98 which had the highest identity with Gray and 
Ark99 belonged to subgroup B. Group II could also 
be divided into C and D subgroups] of which SC2-93[] 
SD1-97 closely related to D1466 belonged to subgroup 
C and HaN2-95[] GD1-87 which had the highest iden- 
tity with H120 belonged to subgroup D. S14 strain 
had the highest identity with QXIBV and belonged to 
Group II] Fig. 51. 


group III[] and was also closely related to Groups I 
and II but distantly to Group IV] Fig. 4[. 


h52-gd 

JI1-97* 

2x 1-98* 

DE072 

HaN2-95* 
HeN4-94* 

m41 Group I 


D1466 


es iF 
qxibv* 


Group IV 
Cu-T2 
2 0 


Phylogenetic tree of M protein gene. * IBV isolates in China 


3. Discussion 


As an RNA virus] IBV is prone to genetic muta- 
tion and recombination during the process of replica- 


tion|] resulting in the emergence of new mutant 
strains with different serotypes. Thereforel] it is diffi- 
cult to develop effective vaccines against IBV infec- 
tion. As a result[] avian infectious bronchitis still pos- 
es significant economic threat to the poultry industry. 
In our previous work[] we obtained four coronavirus 
isolates from wild partridge] one of which is designat- 
ed as Partridge coronavirus S14. 


Phylogenitic analysis of the whole genome of S14 
helped us find out the origin of the virus. By analyz- 
ing the structural protein gened] SIL] S2L] M and NU 
of S14[] S14 was clustered with Group II viruses by 
S1 gene[] the viruses in this group are all nephrogenic- 
type and proventriculus-type IBVs isolated from Chi- 
na. S14 had a high identity with nephrogenic-type 
strains JX1-99[] TJ2-96[] QXIBV[] LX4 and BJ. 
While S2 gene of S14 strain had a much higher genet- 
ic identity with the Chinese proventriculus-type 
strains QXIBV and LX4[] which was 97.2% and 
90.6%[] respectively. They formed a subgroup II 
with another Chinese IBV isolate BJ. Phylogenetic 
analysis of M gene showed that S14 belonged to 
Group III and had a higher genetic identity with the 
Chinese isolates SAIB20[] GD6-98 and LX4 than that 
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of BJ and QXIBV. Analysis of N gene grouped S14 to 
Group III which included the Chinese IBV strains 


9.7 


f California Santa Barbara] at 17:58 29 February 2016 


It was considered that S2 and N genes of IBV 
Owere conserved. In our research] the S2 and N genes 
“sof S14 were closely related to the Chinese IBV isolates 
such as QXIBV[] LX4 and BJ[] so was the S1 gene. 
However] it was not the case for M gene. Analysis of 
2M gene showed that the M gene of S14 was closely 
grelated to Iowal] SAIB20[] GD6-98 and LX4 but very 
Zdistantly to BJ and QXIBV. Since S1[] S2 and N 


agenes of S14 were closely related to those of Chinese 


y 


Zi solates QXIBV and BJ[L] whose sequences were all 
distantly related to those of strains from other coun- 
triesL] QXIBV and BJ might be the virulent isolates in 
China where S14 was originated. As far as M gene 
was concerned[] S14 was not closely related to vaccine 
strains H52 and H120 which were widely used in 
China. Thereforel] it was predicted that the evolution 
of S14 was also related to the mutation of vaccine 
strains. Some new IBV strains in China and other 
countries might be the recombinants of vaccine strains 
and field strain$ 4. Further study should be carried 
out to determine whether S14 was the recombinant of 


a ee 
peer . x* 
GD6-98* 


wanna nnn nnn nnn nnn TW97-4* 


QXIBV[] LX4 and BJ. 


JL1-97* 
h52 
SAIB14* 
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r 
age Group | 
GX1-98* 
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* 
ah Group I 
HaN2-95* 
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S14* 
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Fig. 5. Phylogenetic tree of N protein gene. * IBV isolates in China. 


IBV isolates in China and introduced vaccine strains. 
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